Corpus: vec-hr_web_2015_10K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 92 94 95 95 95
1000 827 937 973 979 985
10000 5710 8424 9349 9629 9710
100000 5710 8425 9350 9630 9711
1000000 5710 8425 9350 9630 9711


Zipf's diagram for sentence endings


Gnuplot diagram

2555 msec needed at 2018-06-30 16:26